RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 09 / 9%(> 

Source: /(oOO 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: Datin21help@uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM, ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE SEE BELOW: 



^ - Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST. 25 . 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 

Checker Version 3.0 can be down loaded from the USPTO website at the following address: 

htt^://www.uspto.gov/web/offices/pac/checker 



• 



Raw Sequence Listing Error Summary ^ 

SirC flESTED CORRECTION SERIAL NUMBER: 

: PLEASE DISREGARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOF1 

The numberAext at ihc end of each line "wrapped** down to the next line. This may occur if your file 
was retrieved in a word processor after creating it Please adjust your right margin to .3; this will 
prevent "wrapping," 



ERROR DETECTED 

ATTN! NEW RULES CASES: 

1 ^ Wrapped Nudcics 
Wrapped Aminos 



I Invalid Line Length The rules require that a line not ciceed 72 characters in length, this includes white spaces. 



^Misaligned Amino 
Numbering 

Non-ASCII 



3 ' ' Variable Length 



Patentln 2.0 
"bug- 



Skipped Sequences 
"(OLD RULES) 



_ The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use space characters, instead. 

The submitted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission Is aaved In ASCII text. ' 

Sequence^) contain n't or Xaa's representing more than, one rcsT3ue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number or each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

A "bug** in Patentln version 2.0 has cau/ed &e <220>-<223> section to be missing from amino acid 

sequencers) . Normally, Pa\cm^h would* automatically, generate this section from the 

previously coded nucleic acid sequenc^' Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences. 

Sequence^)/ missing. If intentional, please insert the following lines for each skipped sequence: 

(2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) ^ 

(xi) SEQUENCE DESCRIPTIONS EQ ID NO:X: (insert SEQ ID NO where "X M is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:'Vcsponse to Include the skipped sequences. 



8 Skipped Sequences^ Sequences) 



(NEW RULES) 



_Use of n's or Xaa's 
(NEW RULES) 



missing. If Intentional, please insert the following lines for each skipped sequence. 



10 Invalid <213> 

Response 

11 Use of<220> 



<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

Per 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa's are present. 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xaa represents. 

Per 1.823 of Sequence Rules, the only valid <213> responses are: Unknown, Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2 13> response is Unknown or 
^Artificial Sequence 



Sequence^) _ 



missing the <220> "Feature'* and associated numeric identifiers and responses. 



12 



Patentln 2.0 
" "bug" 



Use of <220> to <223> is MANDATORY if <2I3> "Organism" response is "Artificial Sequence" or 
"Unknown." Please explain source of genetic material in <220> to <223> section. 
(See "Federal Register," 06/01/1998, Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy lo Disk" function of Patentln version 2.0. this causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual means to copy file to floppy disk. 



13 



Misuse o(n \ 



n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used lo represent 
any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch - 08/2 1/2001 
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1600 




RAW SEQUENCE LISTING DATE: 12/13/2001 

PATENT APPLICATION: US/09/657 , 986 TIME: 09:54:28 

Input Set : A:\Sequence_listing.asc 
Output Set: N:\CRF3\12132001\l657986.raw 

W--> 1( 

4 <110> v APPLICANT : Madison, Edwin L. 

5 Semple, Joseph Edward Does Not Comply 

6 coombs, Gary samuel Corrected Diskette Needed 

7 Reiner, John Eugene w ^ 

8 Ong, Edgar 0. 

9 Araldi, Gian Luca 

11 <120> TITLE OF INVENTION: Inhibitors of Serine Protease Activity of Matriptase or 

12 MTSP1 

14 <130> FILE REFERENCE: Corvas 255/049 

16 <140> CURRENT APPLICATION NUMBER: 09/657,986 

17 <141> CURRENT FILING DATE: 2000-09-08 
19 <160> NUMBER OF SEQ ID NOS : 10 
21 <170> SOFTWARE: Patentln version 3.0 



23 <210> SEQ ID NO: 1 ^M^f 

24 <211> LENGTH: 1452 j 0 j 



ERRORED SEQUENCES 



25 <212> TYPE: DNA 

26 <213> ORGANISM: Homo_sapiens 
28 <400> SEQUENCE: 1 

E--> 29 gttgttgggg gcacggatgc ggatgagggc gagtggccct ggcaggtaag 
30 cctgcatgct 60^- 



Go 



E--> 3 2 caacaacccc cgtgcctacg cctactcccg ctcaccggga ccgtccattc 
33 ggacgtacga 120 ■ 

E--> 



35 ctgggccagg gccacatctg cggtgcttcc ctcatctctc ccaactggct 

36 ggtctctgcc C 180 
E--> 3 8 gacccggtcc cggtgtagac gccacgaagg gagtagagag ggttgaccga 

39 ccagagacgg 240 
E--> 41 gcacactgct acatcgatga cagaggattc aggtactcag accccacgca 

42 gtggacggcc 300 
E--> 44 cgtgtgacga tgtagctact gtctcctaag tccatgagtc tggggtgcgt 

45 cacctgccgg 360 
E--> 4 7 ttcctgggct tgcacgacca gagccagcgc agcgcccctg gggtgcagga 

4 8 gcgcaggctc 420 
E--> 50 aaggacccga acgtgctggt ctcggtcgcg tcgcggggac cccacgtcct 

51 cgcgtccgag 480 
E--> 53 aagcgcatca tctcccaccc cttcttcaat gacttcacct tcgactatga 

54 catcgcgctg 540 
E--> 56 ttcgcgtagt agagggtggg gaagaagtta ctgaagtgga age tga tact 

57 gtagegegae 600 
E--> 59 ctggagctgg agaaacegge agagtacagc tccatggtgc ggcccatctg 

60 cctgccggac 660 
E--> 62 gacctcgacc tctttggccg tctcatgtcg aggtaccacg cegggtagae 

63 ggacggcctg 720 
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RAW SEQUENCE LISTING DATE: 12/13/2001 

PATENT APPLICATION: US/09/657,986 TIME: 09:54:28 

Input Set : A:\Sequence_listing.asc 
Output Set: N:\CRF3\12132001\I657986.raw 

E--> 65 gcctcccatg tcttccctgc cggcaaggcc atctgggtca cgggctgggg 

66 acacacccag 780 
E--> 68 cggagggtac agaagggacg gccgttccgg tagacccagt gcccgacccc 

69 tgtgtgggtc 840 
E--> 71 tatggaggca ctggcgcgct gatcctgcaa aagggtgaga tccgcgtcat 

72 caaccagacc 900 
E--> 74 atacctccgt gaccgcgcga ctaggacgtt ttcccactct aggcgcagta 

75 gttggtctgg 960 
E--> 77 acctgcgaga acctcctgcc gcagcagatc acgccgcgca tgatgtgcgt 

78 gggcttcctc 1020 j 
E--> 80 tggacgctct tggaggacgg cgtcgtctag tgcggcgcgt actacacgca 0/\A* 

81 cccgaaggag 1080 /)^^h^ ^ 
E--> 83 agcggcggcg tggactcctg ccagggtgat tccgggggac ccctgtccag S r 

84 cgtggaggcg 1140 
E--> 86 tcgccgccgc acctgaggac ggtcccacta aggccccctg gggacaggtc 

87 gcacctccgc 1200 . 
E--> 89 gatgggcgga tcttccaggc cggtgtggtg agctggggag acggctgcgc 

90 tcagaggaac 1260 
E--> 92 ctacccgcct agaaggtccg gccacaccac tcgacccctc tgccgacgcg 

93 agtctccttg 1320 
E--> 95 aagccaggcg tgtacacaag gctccctctg tttcgggact ggatcaaaga 

96 gaacactggg 1380 
E--> 98 ttcggtccgc acatgtgttc cgagggagac aaagccctga cctagtttct 

99 cttgtgaccc 1440 
E--> 101 gtatagcata tc 

102 1452 

196 <210> SEQ ID NO: 5 

197 <211> LENGTH: 28 

198 <212> TYPE: DNA 

199 <213> ORGANISM: Homo_sapiens /W-^ 1 ^^ 
201 <400> SEQUENCE: 5 

E--> 202 caccccttct tcaatgactt caccttcg 

203 28 

206 <210> SEQ ID NO: 6 

207 <211> LENGTH: 18 

208 <212> TYPE: DNA 

209 <213> ORGANISM: Homo_sapiens 
211 <400> SEQUENCE: 6 

E--> 212 tacctctcct. acgact.cc 
213 18 

216 <210> SEQ ID NO: 7 

217 <211> LENGTH: 25 

218 <212> TYPE: DNA 

219 <213> ORGANISM: Homo_sapiens 
221 <400> SEQUENCE: 7 

E--> 222 gaggttctcg caggtggtct ggttg 
223 25 

226 <210> SEQ ID NO: 8 

227 <211> LENGTH: 39 
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RAW SEQUENCE LISTING DATE: 12/13/2001 

PATENT APPLICATION: US/09/657,986 TIME: 09:54:28 

Input Set : A:\Sequence_listing.asc 
Output Set: N:\CRF3\12132001\I657986.raw 

228 <212> TYPE: DNA 

229 <213> ORGANISM: Homo_sapiens 
231 <400> SEQUENCE: 8 

E--> 23 2 ctcgagaaaa gagttgttgg gggcacggat gcggatgag 

233 39 

236 <210> SEQ ID NO: 9 

237 <211> LENGTH: 36 

238 <212> TYPE: DNA 

239 <213> ORGANISM: Homo_sapiens 
241 <400> SEQUENCE: 9 

E--> 24 2 gcggccgcac tataccccag tgttctcttt gatcca 
243 36 
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His Asp Gin Ser Gin Arg Ser Ala Pro Gly Val Gin Glu Arg Arg Leu 
65 70 75 80 

Lys Arg lie lie Ser His Pro Phe Phe Asn Asp Phe Thr Phe Asp Tyr 
85 90 95 

Asp lie Ala Leu Leu^ Glu Leu Glu Lys Pro Ala Glu Tyr Ser Ser Met 
100 105 110 

Val Arg Pro lie Cys Leu Pro Asp Ala Ser His Val Phe Pro Ala Gly 
115 120 125 

Lys Ala lie Trp Val Thr Gly Trp Gly His Thr Gin Tyr Gly Gly Thr 
130 135 140 

Gly Ala Leu lie Leu Gin Lys Gly Glu lie Arg Val lie Asn Gin Thr 
145 150 155 160 

Thr Cys Glu Asn Leu Leu Pro Gin Gin lie Thr Pro Arg Met Met Cys 
165 170 175 

Val Gly Phe Leu Ser Gly Gly Val Asp Ser Cys Gin Gly Asp Ser Gly 
180 185 190 

Gly Pro Leu Ser Ser Val Glu Ala Asp Gly Arg lie Phe Gin Ala Gly 
195 200 205 

Val Val Ser Trp Gly Asp Gly Cys Ala Gin Arg Asn Lys Pro Gly Val 
210 215 220 

Tyr Thr Arg Leu Pro Leu Phe Arg Asp Trp lie Lys Glu Asn Thr Gly 
225 230 235 240 

Val 



<210> 3 

<211> 23 

<212> PRT 

<213> Homo_sapiens 



<220> 

<221> misc feature 
<223 



>/ R=A, G; V=G, A,C; W=A,T; S=G,C; Y=C,T; H-A, T,C ) J \J , > 

" === ^ cl rth ^ ru^Jxf^ 

<400> 3 ' ^ , 

Thr Gly Gly Arg Thr lie Val Thr lie Trp Ser lie Gly Cys lie Arg ~* 

15 10 15 j 

<213> Homo_sapiens /2^^^ ' 



Cys lie Cys Ala Tyr Thr Gly 
20 

<210> 4 
<211> 30 
<212> PRT 



<220> 



Of /est, f ft J[ 



<221> misc feature ■ ~~ * ^ 

<223> ( /fc^ A,G; V=G,A,C; W=A,T; S=G,C; Y=C,T; H=A, T,C^ ^-^ /)4^>7A - ^t/j^C. 

<400> 4 / 

lie Gly Gly lie Cys Cys lie Cys Cys lie Ser Trp Arg Thr Cys lie 
15 10 15 

Cys Cys Tyr Thr lie Arg Cys Ala lie Gly His Arg Thr Cys 
20 25 30 

<210> 5 
<211> 28 
<212> DNA 
<213> Homo_sapiens 

<400> 5 

caccccttct tcaatgactt caccttcg 
28 




<210> 6 

<211> 18 

<212> DNA 

<213> Homo_sapiens 

<400> 6 

tacctctcct acgactcc 
18 



<210> 7 

<211> 25 

<212> DNA 

<213> Homo_sapiens 

<400> 7 

gaggttctcg caggtggtct ggttg 
25 



<210> 8 

<211> 39 

<212> DNA 

<213> Homo_sapiens 

<400> 8 

ctcgagaaaa gagttgttgg gggcaeggat geggatgag 
39 



<210> 9 

<211> 36 

<212> DNA 

<213> Homo_sapiens 

<400> 9 

gcggccgcac. tataccccag tgttctcttt gatcca 
36 



4 




VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/657,986 



DATE: 12/13/2001 
TIME: 09:54:29 



Input Set : A:\Sequence_listing.asc 
Output Set: N:\CRF3\12132001\I657986.raw 



L:l M:259 W: Allowed number of lines exceeded, (1) GENERAL INFORMATION: 
L:29 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 50 SEQ:1 
M:254 Repeated in SeqNo=l 

L:202 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:28 SEQ:5 
L:212 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:18 SEQ : 6 
L:222 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted:25 SEQ : 7 
L:232 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 39 SEQ : 8 
L:242 M:254 E: No. of Bases conflict, LENGTH : Input : 0 Counted: 36 SEQ: 9 
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